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1 gctcacagac cgggactccg cctccggttc ccgagggcgt ggcgaggcgc 

51 tgcgggacgc ccaacaggtg cgtgttgtgt ccccaggccc cgcgctccgg 

101 gtggagtcaa gagcctggaa gccggcagcc cgggaaaagg gggcgggacg 

151 gtgccccggg gcagggctgg gtggcggccg ctgtcctccc gggaggggcg 

201 ggccgcctcg acgccgccct ccctggcggc caatggagac cgaggccccg 

251 cgcctggatt ggagcggacg cgggggtcag ccagccttgg gggccggggc 

301 ctggccgggg gcgggggggc aggcgaggcg aggcgggcgc cgtccgcgcg 

351 at tataao Qc ggggagttcc ctgcgccgcg agccgggagg cgcacgctcg 

401 ctcgtacggc ggccgcggcg gcagggcggg gccggagcag cgggcggcgg 

451 cggaggcggc gcccgggagc act CTTCGQT TCCCTCGGGG TCTTGCTCGG 

501 ACCTCGGCCA CCGCCTGGGA TCCCCAGGAC TCGTGCGTGC AGCATGGGCG 

551 — gcGTcggggA sccGggACcg cgggftC Q gAC ccQCgCAgcc qqqqgqqcqg 

601 CTGCCCACCT TCTGCTGGGA GCAGATCCGC GCGCACGACC AGCCCGGCGA 

CAAGTGGCTG GTCATCGAGC GCCGCGTCTA CGACATCAGC CGCTGGGCAC 

_7m — AgcggCACCC AGGggGCAGC QGQQTQATQQ CCCACCAC gg QQC TQA QQAQ 

751 GCCACGataa ggaagccata aggaagccac ccaccggcgg gtggagcctg 

801 gagctcggtc gtgggcgtga tgtcccgctc cacctgtggg gccttagcat 

851 cctccctccc ctcgctgacc tttgacctcc acgccgggac ccagagttgg 

901 ggtggactag ccagggccag atgtggggta gggagggcag ttccctgcgt 

951 ggaggacccg cagctgtcca cggagcaggt ctgcggggga ggagggggcc 

1001 tcagaggtgg gtgtgtcatg ctgcagagcc tgccctgggt gaggggctgc 

1051 cctgttgctc ccaggtccct gtttcagttc tgggtcccca tgctgggtgc 

1101 ttgctgagtg ctaggggtag ggcagggcag ggtccccagg ggccggtaag 

1151 gacatgccat tagaggctgg gggctgggcc ggcctgaggt ctgtggcttt 

1201 cccaagagct tctgtaaagg gctcagggac agtgactcac ctctccgggc 

1251 tagcagctgc acgtgggagg gctttgccag ccaggctggg tgggcctctc 

1301 ctggaagcac agtcacccca ggaacaggct ggcccctggg gaccccaact 

1351 tcccaatccc agcccctgtc tagacaggca gggatgtagc ctggccccag 

1401 ggtactgtct ggctggagtc cagtggtgga gcagcccgac cagccccttt 

1451 tccttagtta cccacctgca taataggggt tggggccacg atgccctgtc 

1501 cttgaccctc caaatttcta ggttggccac actgggtatc aggaaggtct 

1551 tcaagacccg aggacatgaa tcctgaatgc tggctttttg ggcagcagcg 

1601 gaggttctgt ccagtcccag gactgtcggc gtccctcttg ccagggccac 

1651 ctgctctctg ccgattgcca tctccagcat gttggacaat cttcactgga 

1701 ctctttgagg aagaaagccc ctcttttccc tttccacccc atgaagctga 

1751 ggagtgagaa taagaatcct cctgaaattc taaaaaaaga aaaaaaaaaa 

1801 aaagagaacg ccttgtccgt ggctgttcag gcgccagacg ctggcccgag 

1851 gggacagcac agccgtggga tgaagcagcc tgggggcagt atttgagcgt 

1901 gcaggtgttt gcatgtctgg gtgagtgtgg tgtgtgtgcc tgcctttctg 

1951 ccagggcgtg gcgaggtgag gggcacggct tctccccaaa ggccttgctg 

2001 agccctggcc tcccttcaag gagtcttgtg gatgcctgct ctggtctttt 

2051 tttaaaaaag tatctatttt atttattatt atttgtttaa aaatagagac 

2101 agggtctcac tatgttgctc gggctggtct caaagtcctg ggttcaagca 

2151 ttcctcctgc ctcagcctcc gaaagttctg ggattacagg catgagccac 

2201 cactcccggc ctgctctagt cttttgtaac ctagaggaca gtatggatac 

2251 agaaaacttt actccccacc aaccgccgga gacagagtct tgctctgcca 

2301 cccagactgg agtgcaatgg cgccatcttg gctcactgca acctccgcct 

2351 cccaggttca agcgattctc ctgcctcagc ctcccgagta gctgggatta 

2401 cgggcacgcg ccaccacgcc cagcatattg tatttttagt agagacgggg 

2451 tttcaccatg ttggccaagc tggtctcgaa ctcctgacct cgtgatccac 

2501 ccacctcggc ctcccaaagt gctgggatta caggcgtgag ccaccacgcc 

2551 cggctgggat acagaaagct tttatttcat cactgtttcc tgcctggtgc 
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2601 caggcccatg ctggggttcc 

2651 gcttgggatc ctgagacttc 

2701 caataatatc tgttttaaaa 

2751 ctgtaatccc agcactttgg 

2801 gggagtttga gaccagcctg 

2851 aaaaatacaa aattagccag 

2901 ctttgggagg ctgaggcagg 

2951 ccggtgagcc gagatcgcgc 

3001 aaactccgtc tcaaacaaac 

3051 ccgggtgcca gctctgctat 

3101 tgtcactcct ctgtgcccca 

3151 ggcagacagt- gagctggctg 

3201 gcccagggct gccaggtagt 

3251 taagtgtgag cgtgccctgc 

3301 acggctaccc ctttcctggg 

3351 gtaggaaagg gacactttag 

3401 gcctggacta gagttgccag 

3451 tgtgaggttg ctgggatggt 

3501 agggtagacc aggggttggg 

3551 ggccccaggc atcccatccc 

3601 ccagttgcca gggatgtgtt 

3651 cccatcagct gttctcaagt 

3701 tcccagagtt tctgtgccat 

3751 aggccgggtg ccctggggag 

3801 gagagtcata ggcagcctga 

3851 gcccagccgc tgctggccat 

3901 actatgtgcc aggttttgaa 

3951 tttaatcctc aaggtgcccc 

4001 cacttgacag atgagaaaac 

4051 ttcagtgatt ggtctgggat 

4101 gctatgctgc cacctatcct 

4151 tgtttgactc attaaatcca 

4201 gttctcacct ctgaagatgt 

4251 gttcacagca gagggatcag 

4301 cgtcctgtgt tccaggaata 

4351 gagggcctgg cagtgaggtc 

4401 gtttgtgtcc tcggaaggac 

4451 cccagagagc acagcaggga 

4501 gatccttaat ggagagagtg 

4551 gggaggatgg agacctgccc 

4601 gagtgtggcc agggcagaga 

4651 gtccagtgag agccattaga 

4701 gagagggagc cagcctggcc 

4751 cttctgggtc tcagtttccc 

4801 gtcctgggca cactatgagg 

4851 catgggcaac aaagctctat 

4901 caactcaggg cggatgagga 

4951 ctgaagccct aacaactgtg 

5001 gccctctgcc ttaggaccct 

5051 agctggagga gggagtggcc 

5101 ttgcagccaa cagttattga 

5151 gtaactgatt gaattcgcct 

5201 gcccattttg tagatgagga 

5251 aaccttctca ggacccttga 
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tcccaagtgg aattactgac ttaacattta 
catcacacag ttttctcatt gattcgcagc 
acatctcagg ccgagcgctg tggctcacac 
gaggctgagg tgggcagatc acctgaggtc 
accaacatgg agaaaccctg tctcttctaa 
gcgtggtggc gcatgcctgt aatcccagca 
agaatcgctt gaacccagga gacggaggtt 
cattgcactc cagcctgggc aacaagagca 
aaacaaaaaa catctctctg ctccttgggg 
tggaggcact gagcgacctt gaagcaggca 
gtttactcat ctgtaaagtg ggagagctgg 
agggcaggac tgtgtctcct caagcccatg 
agtttgtatt cggtaaatgc tgctggcccc 
aaactgcagc gtatggtggg acagccctgc 
tgaccttatt tggttacggt cctatctgaa 
gctgtctctt agctccctca aggccccaca 
aaatacttgg tccattcagg ccaaagggac 
gcaatcagtc tttgtccatg atgaacccac 
ccagcccagt gccctgtgta gttgagccca 
gggcggtggc ctcaggtgga ggtggggcag 
ccagcggtca cctctcacca gccccggctg 
ccaggcaatg aagccttcct gccaggaaat 
gaagtcagcc tgtggccatc ttgggacaca 
agtactctgg gcccttggcc aggtttgtct 
tactagtgga gccagccagg gagggatgag 
aagtatataa gggccatgtg ctgagtgcct 
atcagtactt gatttattga aaccctctct 
tatgaggcac gtaccattta ttgttattgc 
agaggctcag agaggcaaag tggcttgaaa 
ttgaatccac agccatgttc ttaagggcat 
gtttatttcc ggcactcatt gattcttcaa 
tcagtgagca tcttctctgt gtcatgcatg 
agctgtgagc aaaacttcta cagggaatga 
ctagagcaaa ggctcagagg tgggaccgtg 
cagtatggct gcagcagaga gcagtggaga 
tagaggcggc cgggctggct catgctggat 
tttggcttta ttttaaagag gatggggagc 
agcctgggga gtctgatgga catttaaaag 
aaggcagagc cttccagaag ggtaagagaa 
tcccccaagg gaggccactt agaagaggta 
gcaagagagg ctgtggacac aggcacactg 
cacattagat ttagcttcat gttgtcttta 
tcgctctatg atcttggaca catcctttca 
cattagtgtg atgaggatga gaatgctttt 
gtggtgctgg gcacctgggt gcctggttac 
tcatgggtgt ggtgaatgca ttgcccacag 
gtttcccagc agcccctggt gccctttcgg 
ggaaaatcca agttccagca gaccccctga 
ccttctaggt ggttctctga gcctggcctg 
agtgctgcag cagaggctgc ttcatagtaa 
ctaggcactg ttctgagggg tttagatgtg 
aacaacttta tgaggtaagt cctattgtta 
gactgagttt gaaactgggg ggtgtaatgg 
agggtagggc ctttgtactc gggccacgag 
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5301 ggtggggttt gtgtctgggt gggagctggg gagggacagg actaggatta 

5351 ggcagatctg aggccacagg agttggttgg ggggtggctc cagagccact 

5401 ccactccctc ctaccacatt gactgccttg aaagtcccct aatggccact 

5451 cccatgaagt gtgactgctc tgggctcccc gcaggcgttt tctgcaaggc 

5501 caccgcccac ccaggcccct tccccagagg ggctgcagtg ccttgctcct 

5551 tccttgtggg aagagttggg attgtctggc gtcagcagga tactgcccct 

5601 gggcatccct cccggtctct tcctgcgggt ttctgatgaa acagccaggc 

5651 tccagtagtg gagccagagg tcagtggtgg agagaggacc aggagccaga 

5701 gggtatagct gctttggggc tactgtgggg tcagggacac ttgtgaggcc 

5751 aagcgtcctg gctgcaggag ccctcacata tatgcccacc cttcaccagg 

5801 acattgaggg gtgctggggg acaggggtag ctttttgggg gtgtctgcct 

5851 tcgacttggg ctccgctaca caggccaaat ttggatgtcc catgtttaga 

5901 gctgtgtttc tttgggacct cttggggcct cagtttcctc atctgtaaaa 

5951 tgggatactg atagtgcttc cccactggcc tcctctgacg ggcgccaggg 

6001 agaggatggg acggagcatg gtgtgctggg cacgctcctg ctgtacccac 

6051 ccacctggga gaggggagag gcaggaatgt cctgg'gggtg tcctttgagg 

6101 catagccctg tcaccccaac atcctacaaa ggcatgagaa ggcagcgagg 

5151 acagaccccg accacctgag ccctcagcag ccctgccaca ctccctgctt 

6201 cacccccttc ctgactgatc tggcacattc ttgattctcc tagggagtga 

6251 cccaaaatcc ctccctgccc tgctgtgtct ctggggtgga aggaggctgc 

6301 cagcccctcc tctctcccag cctcaggctt ggccaggact taacaggcag 

6351 gcagagaagc agcttctcca ctctcttccc tgacacctgt aggcccctcc 

6401 tgcaggcact tacctctaag tggactctca ggaggaggct catcagggct 

6451 gcagggctca gaaagagctg ggctgtggag ctcttgccaa ccgccaggcc 

6501 ccttctaagt gctttagcgc caccgactgc atcctcccag cagccttgtg 

6551 agatggggat ttgtggttcc cagtttactg atgagaaata ctgatgagag 

6601 atgggtgtgg tcttgtctgg ggctccctgg ctcctggata gcagctcagg 

6651 ttccatcctg ggcaggctgg ctctgggaca cccccccgac cagctgctgt 

6701 gtgggattca cggtggggct tgggcagggc gtgggatctt ggggccaact 

6751 gagccactct aggcttccag ggaccaaggc caggctgagc tgtctctgta 

6801 tcctgagaga gcatgaacat cacagaagat gggcccgggt tcgaatccca 

6851 gctctgccac tactaactgg gacctgggca ggggtccctt cccgctgagc 

6901 cttcatttcc tcaccagcaa aatggttcgt gcccctgctt tgggggctgt 

6951 ggagggttgg ctcttgtcta cttgttcata cctgctgttg agcagctgct 

7001 ctgtgccggc ctctgaggat gccactgtga acagagcctg tcgctacctc 

7051 caggagcttg tgtttagggg tgccgttttg attccagcac tttcacccag 

7101 ctctgctccg gtacccgatg agagacgtcg agtgccgctt tccactcgct 

7151 tgggtgcgtg tgggggttgg ggggacaggg ctttgtgcac gtagccctgg 

7201 gtggatgttc ctgggtgcac ttagggtgtg tgagggtggg acctcccaca 

7251 gttccctgag gctccactga tgaggtccaa gaaccgcctt cctgcccccc 

7301 agcccaggct cccagcagct gggcccttgg cttcttgaga tagtgactgg 

7351 cctcacggca aggacccccg cacaccacct aggagaactg ctgcttcccc 

7401 tctgttccag gagtggcgac aagcacagtt tttcgctttt gtttttgttt 

7451 tcttcacttt aagttccggg aaacgtgcag aatgtgcagg tttgttacat 

7501 aggtatacat gtgccatggt ggtttgctgc acccgtcaac ccctcatcta 

7551 ggttttaagc tccatataca ttaggcattt gtcctaatgc tctccctccc 

7601 cttgcccctc acccgcccag taagccccgg tgtgtgatgt tcccttccct 

7651 gtgtccatgt gttctcattg ttcaactctc acttatgagt gagaagagac 

7701 ctggactctg atctaacctc ggtcaaatgg aactgtgtga ccttgaagaa 

7751 gtagcttaac ctctctgagt cttagcttct gcctggcacc cccatcctta 

7801 aggagaggcc cacagaggac caggtcacat gacctcagcc agttccagag 

7851 aaggctgttt gcttccaggt ttcggcctga gtccaggccc ctgccctact 

7901 cgcactccct gatagcatga gaagcacagc cccagggtgc ccacccagct 

7951 ctgagagccc agcctgcttc ccagggaact gtcacagccc cacctgtccc 
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ctgaggtggg 
acatagtgag 
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ggagccctgt 
cacacttccc 
ggcaggcttg 
gggtgaatca 
aagatcctca 
gggaagggct 
gttgcgccgg 
gtgcaaggct 
tgtttctgag 
ctgcatgtgc 
ccatcacgct 
caacatggat 
caaaggaaca 
gatctgtatg 
gagaatggag 
tgaaaaggtt 
tgcacttaat 
tttcatgtat 
cttatgcctg 
tgagctcagg 
ctacgaaata 
ccacctactc 
gaggttgcag 
gagcaagact 
caggtgtggc 
aggatcgctt 
accttgactc 
gcctgtagtc 
ccaggatttc 
ctggtgacag 
gggctgggcg 
aggtgggtgg 
ggcaaacctc 
gtggtgcaca 
caccfcgagcc 
cactctagcc 
aaaaaaaaaa 
ttccctccca 
ggcccagaca 
taccagtttt 
tctgcactta 
aggttattcc 
ctgggcctat 
tacctcttag 
tggtaggaga 
tttaagactc 
ccctgtcctc 
ggcctccctt 
tataagctgc 
agtacaagga 
ttacaatacc 
gtgaagttca 
tccctggccc 
ctttactctt 
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caatggcttt 
cttatcattg 
gttctgtcct 
ggaccaacgg 
gctcaataag 
gggagggagg 
gtgtgtctgt 
gtgtttggct 
tgtctgagtg 
gtgtgtgtct 
ggctcagcct 
ggaccttaag 
aacatgtgat 
gacagaaggt 
aattagtgtt 
ctggagctgg 
accactgagc 
attttactac 
taatcccaac 
agttcaacac 
tacaaaaatt 
agtaggctaa 
taagccgaga 
ctgtctcaaa 
tcacacctgt 
gaagtcaggc 
tacaaaaaaa 
ctagctgctg 
aaggctgcag 
agtgaggccc 
tggtggctca 
attgcttgag 
atctctacaa 
cctgtactaa 
cgggaggttg 
tgggcgatac 
aaaaacaccc 
gctcccctgt 
gcttcctggg 
gctgcccctc 
tacagccccc 
agaaggggca 
aaaagcaggc 
tgggtgctgg 
atgtcctggc 
accagctggt 
agtcagcttt 
ctccagaccc 
tgtatggctc 
aaaaaatcag 
aagaccacag 
agcttacttt 
ttatctttct 
tcttccttct 



ggggttctct 
caaggggtag 
cccctgtcag 
gaaaggaggc 
gtggccccag 
agattctggg 
gcgtgccagc 
gagtgttcat 
atgcctgctg 
ggggagtttc 
taaaaaggta 
gacattgtgc 
ttctcccaga 
agcatggtgg 
taatggggac 
atgatggtga 
tggacaccta 
aatttttaaa 
actttgggag 
cagcctgggc 
agcctggtgt 
ggcacaagaa 
tcatgccact 
aaataaaaga 
actctcaaca 
atttaagaca 
ttcaaaagtt 
gggaggctga 
tgagctgtga 
tgtctcaaaa 
ttcctgtaat 
cccaggagtt 
aaaataaaaa 
cagctacgag 
aggctgcagt 
agcaagaccc 
agtggggtca 
acaccagccc 
gacccccagc 
cttcaagact 
tcctctgtaa 
ccctaccagc 
aaggggaccc 
aggggcctga 
agtgacttca 
aggctcatta 
cttcaaaggt 
atggggacaa 
tggctagccc 
tccaagagct 
gcagacctgg 
acttctgggc 
cctggtctgt 
cctgcatcgt 



gacacagccc 
atctggcttg 
tgcctcgaca 
gaggagacca 
aactgacatg 
gccgcagcca 
tgcatctttg 
gtgggccgtg 
gtgtgggctg 
aaaggagaaa 
ggacatcctg 
tgagtgaaac 
tgaggtttcc 
ttgccggggc 
agagtttcag 
tggttggaca 
aaaatgctta 
aaattggctg 
gccaaggcgg 
aatatggtga 
ggtggcttgc 
tctcttgaac 
gcaacccagt 
taaataaaaa 
ctttgggagg 
tgcctaggca 
aatgagacat 
ggtgggagga 
ttgcatcact 
aaatttttca 
tccagcactt 
taagaccagc 
taaaaaatta 
agaggctaag 
gagccatgat 
tatctcaaaa 
gtagaacccc 
cagctctgca 
cttccctctg 
catgtccaga 
tgagtgagcc 
cccccagtcc 
ctagtagatc 
agtgctttct 
gggcccgctg 
gcaagaggac 
gtttccttta 
caccacccag 
attcagagaa 
gtgaacaatt 
aaggctaagt 
cacttcctgg 
cttctcttct 
actccacccc 
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10701 cactccagct attacacaga atcgcgagaa tgttggatta ttcattttat 

10751 ttatgatgtt ttcttttttg taaaaataga gacaaggtct cactatgtgg 

10801 cccaggctgg tcttgaactc ctggcctcaa gcaatcctcg tgccttggcc 

10851 tcttacagtg ctgggattac agatgtgagc caccatgcct ggcccatttt 

10901 atttacttta aaaaaaaaat taggctgggc gcggtggctc acacctataa 

10951 ttccagcact ttgggaggcc aaggtgggca gatcaactga ggtcaggagt 

11001 taaagaccag cttggccacc tggggtcagg agtttgagac cagctactcc 

11051 ggaggctgag accggagaat tgcttgaacc caggaggtag aggttgcaat 

11101 gaactgagat catgccattg catgccagcc tgggcaacag agcaagactg 

11151 tctcaaaaaa aaaaaaaatt atgttttgtg ctcctgcttc ctgctttgta 

11201 agtcaaatca gtttaactgt tcaagtgtct tccttgcaaa cccccaagga 

11251 ctcaatgtgt gtcgcccttg actgatcccc ccgccccgtg acccagtggt 

11301 cctcagttcc aggttttccc acctaccctt cacccactgc ttatgtttat 

11351 aaaaacgggg taaatcaaat gttcgtgacc cagatcttat tctacatgca 

11401 gtggaaactt gtatgactta agctttttgg aaaagcagaa ccttttttcg 

11451 tggttcaaga aatcaaagtc ttcccgggag gtctttctgt aaatccagag 

11501 ctgcagatgt ttgaccgtgt tcagagaggg gcccttgtgc tgggtgaagt 

11551 ggatggggca cagcaggcaa tgggtgaaaa gcaggacaac ctggggccct 

11601 gggaggacca gggagggccc atgtctttga ctgttcatca gccggctgac 

11651 ttcctgtccg cctgtcgtct gctctgccca tccatccgta gtccttccgc 

11701 ctgtctctgc tggttgccgc tgtgctactc agctgtgtct gtctgtccgc 

11751 ctgactgtct gctctccttc aa GATGCCTT CCGTGCCTTC CATCAAGATC 

11801 TCAATTTTGT GCGCAAGTTC CTACAGCCCC TGTTGATTGG AGAGCTGGCT 

llg?l CCGGAAGAAC CCAGCPA GGA TGGACCCCTC MIgtgagcc agagccctag 

11901 gagaggctca gcccctgagg gagggggatg gctggagggc tgggagacat 

11951 tgccacatgg ccaggagcag ctccctcggc attcgcccaa ggggatgcag 

12001 agccagggct gagcctgccc tcccctccca gggggcaggc agttgaaagt 

12051 gaagctgtag ggatgccctg agaagtccag ggctccagat ctggtttagc 

12101 caggcactcg tttggatccc gaggcaagct ccctccctgt tgtcgcccag 

12151 tgtccccatc aaaaggagga ttttgatgaa ctgatttctc tcctggctgt 

12201 agcgtcttac ccaccccata ccttttggga gggagaggag gcttcaccac 

12251 cagccagtgc tccagctcac accccgggct gggtactctt gtcacttcat 

12301 tcctctttgc ccacacccct tgggcctggc gatgggagga gcggctgggg 

12351 ctccaggaga atgggggtgg ggaggaattt cttccttggc tgatcggccc 

12401 ctctgctatg gcaa GCGCAG CTGGTCGAGG ACTTCCGAGC CCTGCACCAG 

12451 GCAGCCGAGG ACATGAAGCT GTTTGATGCC AGTCCCACCT TGTTTGCTTT 



12501 


CCTACTGGGC CACATCCrca 


CCATGGAGGT GCTGGCCTGG 


CTCCTTATCT 




ACCTCCTGGG TCCTGGCTGG 


GTGCCCAGTG CCCTGGCCGC 


CTTCATCCTG 


12601 


GCCATCTCTC AC.atn^rrr.r. 


agttctgtgt tgcagccacc 


ttaactgccc 



12651 aacagacgtg ggcccccatg catctgggca ttgtgaacat atttgctaaa 

12701 tgaatgaatg gacctatgaa aggatgaatg gatgaataaa cagatgaatg 

12751 agtgaacagt ctgaaggccc atcaggcatg tctgtgggtc aagctgcatt 

12801 ccagatgagc caagaagttc cttcttgaac agattccgat caagcacagg 

12851 gccactgagc cagaggctgc tgccctgcag cttcatgaca cttacgagcc 

12901 cctccacctc cctgggactc agttctcatc tgtaaaaaga ggacactggc 

12951 ccacaagggt cttgaaatgg agcattagca cgggggtacc ctgcaagctg 

13001 aaaggattca ctggggcccc aggccctggc gggctccgtc cttcccaaca 

13051 gcttctgacc ctgcctctct ccccaa GCTC AGTCCTGGTG TCTGCAGCAT 

13101 GACCTGGGCC ATGCCTCCAT CTTCAAGAAG TCCTGGTGGA ACCACGTGGC 

13151 C CAGAAGTTC GTGATGGGGC AGCTAAAG at gagggtgggg tgggtggtca 

13201 gccaggtgct gggtggcgct gggtctgccc aagtgtgtgg gcacagtcgg 

13251 gggcacagcc tgccctgaga gccccctcct cctccacagS GCTTCTCCGC 
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i:^3Ql CCACTGGTGG AArTTCCGCC ACTTCCAGCA CCACGCC AAG CCCAACATCT 

13351 TCCACAAAGA GCCAGACGTG ACGGTGGCGC CCGTC TTCCT CCTGGGGGAG 

13401 TCATCCGTCG AGgtgggtgg ggagggacct ggacaacctc tggctgggcc 

13451 tgcagctgag ggggagctaa tgcactgggt ccccactctg cccctgacct 

13501 agcccctgat ctggcctcca ctctggctgg gccaagctct gcccccgtgt 

13551 ctttccttcc cacctcccaa cctgctgggg acgaccagcc cgcttgctag 

13601 aatctagagt tgcctttgac ccttggcccc agccagcccc gtgaccttgc 

13651 ccgggagaag gaggtggcct ggagagctgc tgtctccagc cgccgcctgt 

13701 CtccacagIA TGGCAAGAAG AAACGCAGAT ACCTACCCTA CAACCAGCAG 

13751 CACCTGTACT TCTTCCTGAg tgagtgtcca tctgtccttc tggggtgggg 

13801 gagtgcctgg gcctgcactg tcctccctgc tgtcctggac cactcccagc 

13851 cacttcctgg ggcggggcac gtctgtcagg tctccctggt catggcatcc 

13901 tcccagcctc tgcagtctgt acacactctc ccagcagcat gcctttgccc 

13951 cagctgtctc ccgtgcctgg gacaccttgc agccacgggc catcacagcc 

14001 ctgctgggag cttccccaag ccccacgtag aatttcttct tgccctcact 

14051 agagtggtcc ggagccctag agtctttggg cagttgttgg ggcggacaga 

14101 gtgaggactc aagtctggcc ctgacttgcg gtgaagggtg gtgggaggtg 

14151 gtggggtaag ggcagcctgg ggaggcttgg acacagaatt gggggtgata 

14201 tggggtcatt cagctggatg tgaccagcac caacgtccca ggggcattcc 

14251 tggagtaaca gagcccctca ctctggcgcc cactcacctt ggcagcccag 

14301 ccccactcct gaacactctc atgccccttc ttgcaglCGS CCCgCCgCTQ 

14351 CTCACCCTGG TGAACTTTGA AGTGGAAAA T CTGGCGTACA TGCTGGTGTG 

14401 CATGCAGTGG SCQgtgagtg gggttgccca ggaccccggg catacggctg 

14451 ccgtggcagg aggtggtgcc tcgggggaca gtacctgccc atgaaggcaa 

14501 acagggtgca catgtgcgtg caacagtgtg gctcacatgt atgcgtgcaa 

14551 cagtgtggct cacatgtgtg cgcgcagcag gagagcgagt gtgcccgtga 

14601 ctgtacgtgt ggtggggggg ggttgaggaa cagggggggt gtgggtctct 

14651 ctcggtgagg gtgtcttccc aggaggagtt gctgggccga ctctgccagg 

14701 catctgtgtc cctggcaggg tcttccccaa cacaccctgc atgacacctt 

14751 cgtcactaaa atcagcctcg tgagctggca gggcaaggac cctgttcctt 

14801 tactcagctg agaaaaccag agagggtggt ggcctgtcct gggctctgag 

14851 gcaaatcagg cagaagggtt ggatgcctga ggtcctcctc ccacccacca 

14901 ggcctccaga cctccgggca cctggagacc tctcggtatc gcctctgccc 

14951 tcctctgcag GATTTGCTCT GGGCCGCCAG CTTCTATGCC CGCTTCTTCT 

15001 TATCCTACCT CCCCTTCTAC GGCGTCCCTG GGGTG CTGCT CTTCTTTGTT 

15051 GCTGTCAGgt atggcaggga gtggcgaggt caGacacagg cgacaggtga 

15101 cccccactgc agccccccac cagagcttcc cttttcccgt ctgcagaatg 

15151 gggccagtgg tactgcctcc ctggcttgct ggtggaatca cataaacaca 

15201 agcgtggcag gagcccaggg tcggtgggtt tagggagcgt ggcctggctt 

15251 gtaagtggcc cggtgggtgt cggagctgct ctggactcag cctcacagtg 

15301 gacactgctc cattcagatt ctttaaacac tggcaagggg gcgatggcca 

15351 caatcctatt gtacagataa ggaagtcaag gccacttggg gacagctgct 

15401 ctccagcctc cactcagggt gcctaagtgg tgagctggac ctagggcagt 

15451 gcccgagcct ccccacagSS TCCTGGAAAG rrAGTGGTTT GTGTGGATCA 

15501 CACAGATGAA CCACATCCCC A A GGAGATGG GCrACG AG AA GCACCqggAC 

15551 TGGGTCAGCT CTCAG ataaa cagcaggggt ggggcccatc ctgggtgggg 

15601 tggggggtcc cagctaggag ccagatggca aagcagggat gaggccctga 

15651 cggggctgcc aggtggggga tggtgccgtg gggtcaggga tctgcaacgg 

15701 cctcctcaca tgtgccccgc cggcttccgg raaCTGGCAG CCfiCCTgCM 

15751 CGTGGAGCCC TCACTTTTCA GCAACTGGTT CAGCGGGCAC CTCAACTTCC 

15801 AGATCGAGCA CCAgtgagtg tgggtgctgg gggccagtgg gaggtgggga 

15851 gggggtcctg ggaggggatc ctgggagggg acccgtgggt ggggcctctc 
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tctggaatct cccacttcag gtgccagcat acgctcccca cccccag££I 
CTTrccrA(; n ATnrrnAGAC acaactacag ccgggtggcc ccgctggtca 

AGTCGCTG TC TGCCAAGCAC GGCCTCAGCT ACGAAGTGAA GCCCTTCCTC 



16101 
16151 
16201 
16251 
16301 
16351 
16401 
16451 
16501 
16551 
16601 
16651 
16701 
16751 
16801 
16851 
16901 
16951 
17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17651 
17701 
17751 
17801 
17851 
17901 
17951 
18001 
18051 



ctggtggctt 
gctgagcccc 
ctcccttccc 
tgcagactca 
acccccaaag 
aaacacttaa 
cggtggctca 
atcacttgag 
ccgtctctac 
tgggtgtggt 
aggagaatca 
gtgcctttgc 
aacaaaaaaa 
cgccctctag 
tgcaggtggt 
tacaagagga 
cggtcatcag 
aggctctaga 
aggaaacagc 
gcaaaagaac 
gggcggccag 
taittgacagg 
gccacaggga 
gaatgccaag 
gagaagggac 
tgactggcgg 
gccatccagg 
gtggggaaca 
acgctgttcc 
tgcccacttc 
ccatttggca 
tgttgacatt 
gcaggttcca 
agcaatggag 
tggacttggc 
ccggtctaga 
caccctccag 
gaatggggat 
atcttgcaac 
ca aGTCCCTG 



ccccagggcc 
cagggtccct 
cagccttcac 
caaacagcag 
gctgagggaa 
cagggactca 
cgcctgtaat 
gtcaggagtt 
taaaaaaaaa 
ggcaggtgcc 
cttgaactcg 
agtccagcct 
ccatgtctca 
aaagaaccct 
tcttatgaac 
ggttatctat 
acagttttgc 
gagcttcctg 
aagcaattaa 
aatgaaaagc 
gcaggggcgg 
ctgacgtgtg 
catctggcaa 
ctcatggcat 
acgcccggct 
gcaggagtcg 
cgaggggtga 
gaggccagat 
ccttacttgg 
tccttcatga 
cagcccttta 
tctggctcca 
cgtcctctct 
aactgggtgc 
cttggctgcc 
ttccattcca 
cccgcatcgc 
ctccgctcac 
tgagctccta 
AAGAAGTCTG 



CAG ataaaac tgcagcccgg cccct ctgtt 
tatgcctacc cttgtccagg tcagcctcat 
gagcctttct gtccacgtcc catgcccttc 
gcacacagtg agaatttctg gagcacctac 
tgcctgcggt gagcaggtct atgcaaacct 
aaaagctaac agatccagtt tctcagaagg 
taaacagaag ccatgtctca gggccgggtg 
tccagcactt ggggaggctg aggtgggcgg 
cgagaccagc ctggccaaca tggtgaaacc 
aaaaaaaaac aaaacaaaac aaaaattagc 
cataatccca gctacttggg aggctgaggg 
caggggcaga ggttgcagtg agctgagatt 
gggcaacaga gcaagactct ctcaaaaaca 
ggcagccaag agttgggaca tcccctcaca 
ctatatagca agcttttagg gtgaacccca 
ctggtgacca ctggaggtta gataagcgtc 
gccatgagct tggcattcag ggtcaagcat 
ttgaagatgg cattgccctt gtagcaatgc 
ccctcttgga gctgatgttc cttccagcaa 
aataacaaat aagtacatta cagaagatgg 
ccctcagggt ggggacaggg gaggggaggg 
cagtttctaa ataggtggta gggtgggcag 
agcagggaca gggaggaggg gagaggtctc 
agagcgttca ggcagagggc acttgaccct 
agatagccga ggcaggcatg caggcactca 
tgcatcttgg aaagctgccc ctactgggaa 
aagtggaaaa ggagagcaga ggacactgca 
tggggctcag cccttgtggt caccttggag. 
tccaggtctt atacctctgc gcctttgtac 
ttgcccttcc ttcctgtgct ggtgttcaga 
tctctcccag cctgatgctc tgagcccctg 
gagcgcctgg cacagggctt cctagcagat 
ctgcccaata tcaggcccaa gatcgggtgg 
gtccttgggt tgcagcgccc agcaggaggc 
aggagggaca ggcccaccca ggctcatgcc 
ctccagctcc cctacccgac acccgtcacc 
gagaatgagc attcagctgt tctcccaacc 
tgcctgcccc cagggaaggg aacccacagg 
acttaccatg ggggatacag gggtgttagg 
acacccaccc ccactgccac ccccacctcc 
GTGACATCTG GCTGGACGCC TACCTCCATC 



19151 




CCCGGCGGGA TCGATACCCC CACCCCTCCA CTGGCCAGCC 


18201 




CTGCCTGCCC TCCTGGTACT GTTGTCTTCC CCTCGGCCCC 


18251 


CTCACATQTG 


TATTCAQCAQ CCCTATQGCC TTGGCTCTQG QCCTGATC3GG 


18301 


ACAGGGGTAG 


AGGGAAQGTG AGCATAGCAC ATTTTCCTAG AGCGAGAATT 


18351 


GGGGGAAAGC 


TGTTATTTTT ATATTAAAAT ACATTCAGAT GTATTATGGA 



18401 GT 



FIG.2G 



wo 00/21557 



09/8060 

PCTAJS99/23253 



9/19 

1 CTTCGCTTCCCTCGGGGTCTTGCTCGGACCTCGGCCACCGCCTGGGATCC 5 0 



51 CCAGGACTCGTGCGTGCAGCATGGGCGGCGTCGGGGAGCCGGGACCGCGG 100 
1 MGGVGEPGPR 10 



101 GAGGGACCCGCGCAGCCGGGGGCACCGCTGCCCACCTTCTGCTGGGAGCA 150 
IIEGPAQPGAPLPTFCWEQ 27 



151 GATCCGCGCGCACGACCAGCCCGGCGACAAGTGGCTGGTCATCGAGCGCC 200 
28 I RAHDQPGDKWLVIERR 44 



2 01 GCGTCTACGACATCAGCCGCTGGGCACAGCGGCACCCAGGGGGCAGCCGC 2 50 
45 VYDISRWAQRHPGGSR 60 



251 CTCATCGGCCACCACGGCGCTGAGGACGCCACGGATGCCTTCCGTGCCTT 3 00 
61LIGHHGAEDATDAFRAF 77 



3 01 CCATC AAGATCTC AATTTTGTGCGC AAGTTCCTAC AGCCCCTGTTGATTG 350 
78 HQDLNFVRKFLQPLLIG 94 



• • • • * 

351 GAGAGCTGGCTCCGGAAGAACCC AGCC AGGATGGACCCCTGAATGCGCAG 40 0 
95 ELAPEEPSQDGPLNAQ 110 



401 CTGGTCGAGGACTTCCGAGCCCTGC ACCAGGCAGCCGAGGACATGAAGCT 450 
111 LVEDFRALHQAAEDMKL 127 



• ♦ • • • 

451 GTTTGATGCC AGTCCC ACCTTCTTTGCTTTCCTACTGGGCC AC ATCCTGG 500 
128 FDAS PTFFAFLLGHILA 144 



501 CCATGGAGGTGCTGGCCTGGCTCCTTATCTACCTCCTGGGTCCTGGCTGG 550 
145 MEVLAWLLIYLLGPGW 160 



551 GTGCCCAGTGCCCTGGCCGCCTTCATCCTGGCCATCTCTCAGGCTCAGTC 600 
161 VPSALAAFILAISQAQS 177 



. . • • • 

601 CTGGTGTCTGCAGCATGACCTGGGCCATGCCTCCATCTTCAAGAAGTCCT 650 
178 WCLQHDLGHASIFKKSW 194 



651 GGTGGAACCACGTGGCCCAGAAGTTCGTGATGGGGCAGCTAAAGGGCTTC 7 00 
195 WNHVAQKFVMGQLKGF 210 



FIG.3A 



wo 00/21557 



^ 09/806088 

PCT/US99/23253 



10/19 

7 01 TCCGCCCACTGGTGGAACTTCCGCCACTTCCAGCACCACGCCAAGCCCAA 750 
211 SAHWWNFRHFQHHAKPN 227 



751 C ATCTTCCAC AAAGACCCAGACGTGACGGTGGCGCCCGTCTTCCTCCTGG 800 
228 IFHKDPDVTVAPVFLLG 244 



801 GGGAGTCATCCGTCGAGTATGGCAAGAAGAAACGCAGATACCTACCCTAC 850 
245 ESSVEYGKKKRRYLPY 260 



851 AACCAGCAGCACCTGTACTTCTTCCTGATCGGCCCGCCGCTGCTCACCCT 900 
261 NQQHLYFFLIGPPLLTL 277 



901 GGTGAACTTTGAAGTGGAAAATCTGGCGTACATGCTGGTGTGCATGCAGT 950 
278 VNFEVENLAYMLVCMQW 294 



951 GGGCGGATTTGCTCTGGGCCGCCAGCTTCTATGCCCGCTTCTTCTTATCC 1000 
295 ADLLWAASFYARFFLS 310 



1001 TACCTCCCCTTCTACGGCGTCCCTGGGGTGCTGCTCTTCTTTGTTGCTGT 1050 
311 YLPFYGVPGVLLFFVAV 327 



1051 C AGGGTCCTGGAAAGCC ACTGGTTCGTGTGG ATC AC AC AGATGAACC AC A 1100 
328 RVLESHWFVWITQMNHI 344 



1101 TCCCC AAGGAGATCGGCC ACGAGAAGC ACCGGGACTGGGTCAGCTCTCAG 1150 
345 PKEIGHEKHRDWVSSQ 360 



1151 CTGGC AGCC ACCTGC AACGTGGAGCCCTC ACTTTTC ACC AACTGGTTCAG 1200 
361 LAATCNVEPSLFTNWFS 377 



1201 CGGGCACCTCAACTTCCAGATCGAGCACCACCTCTTCCCCAGGATGCCGA 1250 
378 GHLNFQIEHHLFPRMPR 3 94 



1251 G ACAC AACTACAGCCGGGTGGCCCCGCTGGTC AAGTCGCTGTGTGCC AAG 13 0 0 
395 HNYSRVAPLVKSLCAK 410 



1301 CACGGCCTCAGCTACGAAGTGAAGCCCTTCCTCACCGCGCTGGTGGACAT 1350 
411 HGLSYEVKPFLTALVDI 427 



1351 CGTCAGGTCCCTGAAGAAGTCTGGTGACATCTGGCTGGACGCCTACCTCC 1400 
428 VRSLKKSGDIWLDAYLH 444 
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1401 
445 


ATCAGTGAAGGCAACACCCAGGCGGGCAGAGAAGGGCTCAGGGCACCAGC 
Q 


1450 
445 


1451 


AACCAAGCCAGCCCCCGGCGGGATCGATACCCCCACCCCTCCACTGGCCA 


1500 


1501 


GCCTGGGGGTGCACTGCCTGCCCTCCTGGTACTGTTGTCTTCCCCTCGGC 


1550 


1551 


CCCCTCACATGTGTATTCAGCAGCCCTATGGCCTTGGCTCTGGGCCTGAT 


1600 


1601 


GGGACAGGGGTAGAGGGAAGGTGAGCATAGCACATTTTCCTAGAGCGAGA 


1650 


1651 


ATTGGGGGAAAGCTGTTATTTTTATATTAAAATACATTCAGATGTAAAAA 


1700 
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1 GTACAGCGGCAATGGGCGGTGTCGGGGAGCCCGGAGGGGGACTCGGGCCG 50 
1 MGGVGEPGGGLGP 13 



51 CGGGAGGGGCCCGCACCGCTGGGGGCGCCCCTACCCATCTTCCGCTGGGA 100 
14REGPAPLGAPLPIFRWE 30 



101 GCAGATCCGCCAGCATGACCTACCAGGCGACAAGTGGCTGGTCATCGAGC 150 
31 QIRQHDLPGDKWLVIER 47 



151 GCCGTGTCTACGACATCAGCCGCTGGGCACAGCGGCACCCAGGGGGTAGC 200 
48 RVYDISRWAQRHPGGS 63 



201 CGCATCATCGGCCACCACGG 22 0 
64 R I I G H H 69 



FIG.4 



o y © 

wo 00/21557 ~ PCT/US99/23253 



13/19 




14/19 



wo 00/21557 ~ PCTAJS99/23253 



PROFILESCAN of : CYB5rp_correct_protein check: 5714 from: 1 to: 445 

GETSEQ from bmd. December 2, 1997 14:20. 

Compare to profile library: GenRunData:prof i lescon.f i I 



Prof i le: prof i ledir :cytochrome_b5.prl 
Gop weight: 4.50 Gap Length weight: 0.05 
Ave motch: 0.27 Ave mismotch : -0.21 
(Peptide) PROFILEMAKE v4.40 of: 0191.Msf2{*} Length: 48 

Sequences: 24 MoxScore: 27.58 December 2. 1992 00:07 
This profile is derived from PROSITE release 10.0 and has been tested 
by 0 dolabose search ogoinst SWISS-PROT release 26.0. A comparison 
of the SWISS-PROT onnolotion and the results of the doLobase search follows. 
For further information about this motif, consult the . . . 

Profile: prof i ledir :cytochrome_b5.prf alignment: 1 

Quality: 20.77 Gaps: 0 
Rolio: 0.43 Length: 48 
Normal ized quol ity: 2.91 

S 31 HOQPGDKWLVIERRVYDISRWAQRHPGGSRLIGHHGAEDATDAFRAFH 78 

I: ..: Nil- .|||:::| • llll- I .||.|:||.| ::| 
P 1 HNDGEETWLWNGQVYDITKFLEEHPGGPDVIMEAAGTDATEEFEAIH 48 

****************************************** ********** 
♦Cytochrome b5 family, heme-binding domoin signature » 
***************************************** ********** 
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pir :s68358 hypothetical protein - common sunflower 
Length = 458 

Score = 169 (79.4 bits). Expect = 2.8e-42, Sum P(4) = 2.8e-42 
Identities = 31/85 (36%), Positives = 49/85 (57%) ^.^ ^ 



Query: 348 IGHEKHRDWVSSQLAATCNVEPSLFTNWFSGHLNFQIEHHLFPRMPRHNYSRVAPLVKSL 407 

-f€K+W Q T44S + +WFGL FQ+EHHLFPR+PR + ++P+ + L 
Sbjct: 348 VGPPKGDNWFEKQTRGT I D I ACSSWMDWFFGGLQFQLEHHLFPRLPRCHLRS I SP I CREL 407 



Query: 408 CAKHGLSYEVKPFLTALVDIVRSLK 432 

C K+ L Y F A V +++L+ 
Sbjct: 408 CKKYNLPYVSLSFYDANVTTLKTLR 432 

Score = 133 (62.5 bits). Expect = 2.8e-42. Sum P(4) = 2.8e-42 
Identities = 21/53 (39%). Positives = 35/53 (66%) 

HPGG motif 

Query: 26 EQIRAHDQPGDKWLVIERRVYDISRWAQRlHPGaSRLIGHHGAEDATDAFRAFH 78 

++++ H+ P D W+ I +VY+++ WA+ HPGG + + +D TDAF AFH 
Sbjct: 22 KELKKHNNPNDLWISILGKVYNVTEWAKEHPGGmiNLAGQDVTDAFIAFH 74 



Score = 118 (55.5 bits), Expect = 2.8e-42, Sum P(4) = 2.8e-42 

I den t i t i es = 25/76 (32%) , Pos i t i ves = 34/76 

His box 1 



Query: 
Sbjct: 



His box 2 



Query: 165 LAAFILAISQAQSWCLQHDLGH/\SIFKKSMNHVAQKFVh«5QLKGFSAHWWNFRF^ 224 

L+ IL ++ Q L HD GH + WN A F+ + G S WW + H HH 
Sbjct: 152 LSGAILGLAWMQIAYLG HDAGHf lTQMMATRGWNKFAGIFlGNClTGISlAWWKViTHNA^j-II 211 



225 KPNIFHKDPDVTVAPV 240 

N DPD+ P+ 
212 ACNSLDYDPDLQHLPM 227 



Score = 34 (16.0 bits). Expect = 2.8e-42. Sum P(4) = 2.8e-42 
Identities = 7/14 (50%). Positives = 9/14 (64%) 



FIG.7A 



wo 00/21557 



09/806088 

PCT/US99/23253 



16/19 



q p:bou79010 1 PID: q 2062403 Borogo officinolis del to 6 desoturose mRNA. 
complete cds. (gb:U79010) (NID:2062402) 
Length = 448 

Score = 179 (84.1 bits), Expect = 2.3e-42, Sum P(3) = 2.3e-42 
Identities = 34/87 (39%), Positives = 48/87 (55%) 

His box 3 

Query: 
Sbjct: 
Query: 
Sbjct: 



348 IGHEKHRDWVSSQLAATCNVEPSLFTNWFSGHLNFQIEHHLFPRMPRHNYSRVAPLVKSL 407 

-fG K +W Q T ++ + +WF G L FQIEHHLFP+MPR N -K+P V L 
338 VGKPKGNNWFEKQTDGTLD I SCPPWMDWFHGGLQFQI EHHLFPKMPRCNLRK I SPYV lEL 397 



408 CAKHGLSYEVKPFLTALVDIVRSLKKS 434 

C KH L Y FA +R+L+ + 
398 CKKHNLPYNYASFSKANEMTLRTLRNT 424 



Score = 144 (67.7 bits), Expect = 2.3e-42. Sum P(3) =2.3e-42 
Identities = 23/53 (43%) .Posi t ives = 36/53 (67%) 

HPGG MOTIF 

Query: 26 EQIRAHDQPGDKWLVIERRVYDISRWAQR|HPGG|SRLIGHHGAEDATDAFRAFH 78 

++++ HD+PGD W+ I+ + YD+S W + HPGGS + ++ TDAF AFH 
Sbjct: 12 DELKNHDKPGDLWISIQGKAYDVSDWKD|hPGG|sFPLKSLAGQEVTDAFVAFH 64 



Score = 105 (49.3 bits). Expect = 2.3e-42, Sum P(3) = 2.3e-42 
Identities = 22/68 (32%), Positives = 28/68 (41%) 

His box 1 His box 2 



Query: 176 QSWCLQHDLGHASIFKKSWWNHVAQKFVMGQLKGFSAHWWNFRHFQHHAKPNIFHKDPDV 235 

QS + HQ GH + S N F L G S WW + H HH N DPD+ 
Sbjct: 153 QSGWIGkDAGHffMVVSDSRLNKFMGIFAANCLSGISIGWWKWlN lHNAHHl IACNSLEYDPDL 212 



Query: 236 TVAPVFLL 243 
P ++ 

Sbject: 213 QVIPFLVV 220 



FIG.7B 
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[^ pir :s35157 Del ta(6)-desaturase - Synechocystis sp. 
Length = 359 

Score =126 (59.2 bits). Expect = 9.0e-09. Sum P(2) = 9.0e-09 
Identities = 21/54 (38%), Positives = 33/54 (61%) 

His box 3 

Query: 372 FTNWFSGHLNFlOlEhiHLPPRMPRHNYSRVAPLVKSLCAKHGLSYEVKPFLTALV 425 

F NMF G LN Q+ mLFP + +Y ++ ++K +C + G+ Y+V P A + 
Sbjct: 292 FWNWFCGGLNhjQViaHLFPNICHIHYPQLENI IKDVCQEFGVEYKVYPTFKAAi 345 

Score = 36 (16.9 bits). Expect = 9.0e-09. Sum P(2) = 9.0e-09 
Identities = 6/15 (40%). Positives = 8/15 (53%) 



Query: 209 GFSAHWWNFf 
G S+ W +F 
Sbjct: 113 GLSSFLWRYF 





His box 2 
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1. Heart 6. Skeletal Muscle 

2. Brain 7. Kidney 

3. Placenta 8. Pancreas 

4. Lung 9. Retina 

5. Liver 
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